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Noise Spectrum Subtraction Method and System 



FIELD OF THE INVENTION 

This invention is in the field of noise subtraction techniques, and relates to a 
noise spectnam subtraction method and a voice-processing unit utilizing the same 
for use in a voice operated system. 

BACKGROUND OF THE INVENTION 

Voice operated systems are typically utilized in commumcation devices, 
such as phone.devices and computers, as well as in toys. These systems typically, 
comprise such main constructional components as an A/D converter for receiving 
an input analog voice signal, a vocoder, an operating system, a communication 
interface associated with an output port, and a voice recognizer (typically, 
implemented as a separate DSP chip). 

During a transmission operational mode of the communication device (e.g., 
mobile phone), tiie input analog voice signals (e.g., generated by a microphone) are 
digitized by the converter. In the conventional devices, the digitized vpice signals 
are supplied to the vocoder for compression of the voice samples to reduce the 
amount of data to be transmitted through the interface nnit to anotiier 
communication device (e.g., mobUe phone)-, and are concurrently supplied to the 
voice recognizer. The latter receives the digitized voice samples as input, 
parameterizes the voice* signal and matches the parameterized input signal to 
reference voice signals. The voice recognizer typically either provides the 
identification of the matched signal to the operating system, or, if a phone number 
' is associated with the matched signal, provides Hie associated phone number. 
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A technique utilizing the application of a voice recognition fiinction to a 
cotnpressed digitized signal has been developed and disclosed in US Patent No. 
6,003,004 assigned to the assignee of the present aj)plication. 

It is a well-known problem of voice operated systems that background noise 
added to speech can degrade the performance of digital voice processors used for 
speech compression, recognition, authentication, etc. Thus, to improve the quality 
of voice recognition, it is necessary to reduce the background noise in a speech 
signal. 

Various noise reduction techniques have been developed and disclosed, for 
example, in the article S.F. Boll "Svppression of Acoustic Noise in Speech Using 
■ Spectral Subtraction", IEEE Transactions m Acoustics, Speech and Signal 
processing, 1979, V. 27, N. 2, pp. 113-120. According to the known techniques, 
the noise suppression of the digital signal is ^ically carried out before the signal is 
supplied to the vocoder (i.e., prior to signal compression). This approach is 
therefore computationally intensive and slow. This is a serious drawback when 
dealing with mobile phones, since the processing requirements of noise suppression 
and voice recognition pose a severe processing load on the mobile phone and may 
obstruct its operation. It is known to use an additional DSP chip for noise 
suppression. 

SUMMARY OF THE INVENTION 

There is therefore a need in the art to facilitate noise reduction in voice 
operated systems by providing a novel noise spectrum subtraction method and a 
voice processing xmit utilizing the same. 

The main idea of the present invention consists of applying a noise 
reduction to a digital signal representative of a voice signal, after the digital signal 
being compressed. This simplifies the computation. 

There is thus provided according to one aspect of the present invention, a 
method for reducing noise in a voice signal, the method comprising the steps of: 
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(i) processing a compressed digitsd signal representative of the voice 
signal including a speech component and a noise component; and 

(ii) determining, the noise component to be subtracted from the 
compressed digital signal. 

In a preferred embodiment of the invention, the compressed digital signal is 
based on a set of linear prediction coding (LPC) coeflEicients and a residual signal, 
and is obtained by applying LPC analysis to the voice signal. To this end, a digital 
signal may be divided into a series of frames representative of the voice signal 
including a speech cc»nponent and a noise component to be subtracted. The frame 
may, for example, represent about 20msec of the digital signal. Preferably, the 
frame is composed of M digitized speech samples, and the set of LPC coefficients 
contains p coeflScients, such that the ratio p/M is in the range of 0.1-0.25. LPC 
analysis is applied to all frames, thereby obtaining the compressed digital signal 
representative of the voice signal 

Preferably, the processing of the compressed digital si©ial is based on the 
following: determination of a power spectrum of the noise component during a 
non-speech activity and calculation of its average value, calculation of a power 
spectrum estimator of the compressed digital signal with a reduced noise 
component, determination of an autocorrelation ftmction of this signal, and 
determination of modified LPC coefficients. The modified LPC coefficients 
represent the speech component with the reduced noise spectrum. To determine the 
noise spectrum, a calculation involving a Fourier transform can be applied to the 
compressed digital signal. ' To determine the autocorrelation fijnction of the 
compressed digital signal with the reduced noise component, an inverse Fourier 
transform may be applied to the estimated power spectrum of the signal with the 
reduced noise component. 

According to another aspect of the present invention, there is provided a 
voice processing unit for use in a voice operated system, the voice processing unit 
comprising a noise reduction utility interconnected between a voice coding utility 
and a voice recognition utility, the noise reduction utility being operable for 



processing a compressed digital signal representative of an input voice signal 
received from the voice coding utility and generating an output compressed digital 
signal with reduced noise spectrum. 

According to yet anotiier aspect of the present invention, there is provided a 
voice operated system comprising an input port for receiving an input voice signal, 
an analog-to-digital converter for processing the input signal to generate a digital 
output indicative thereof, a voice processing utility ' for processing the digital signal 
and generating a compressed digital signal representative of the input voice signal, 
a voice processing unit, a system interface utility, and a control module, which is 
interconnected between the voice processing utility and the voice processing unit, 
and is connected to the system interface to operate it in response to a speech signal, 
the voice processing unit comprising: 

- a noise reduction utility coupled to the voice processing ijctility and 
operable to process said compressed digital signal and generate an 
output compressed digital signal with reduced noise spectrum; and 

- a voice recognition utility coupled to the noise reduction utility 
for processing said output compressed digital signal with 

. reduced noise spectrum. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In. order to understand the invention and to see how it may be carried out in 
practice, a preferred embodiment will now be described, by way of non-limiting 
example only, with reference to the accompanying drawings, in which: 

Fig- 1 is a block diagram of a voice operated system according to the 
invention; and 

Fig- 2 is a flow chart of main operational steps of a voice processing unit of 
the system of Fig^ 1 . 



DETAILEP DESCRIPTION OF THE iNVEjVTION 

■ Referring to Fig. 1, there are illustrated the main components of a voice 
operated system 10 according to the invention (e.g., a mobile phone device). These, 
components include the following: an A/D converter 14 for receiving an analog 
voice signal coming from an input port 12 (e.g., a microphone), a system interface 
utility 20 associated with an output port (not shown), a voice processing utility 
(vocoder) 22, a voice processing unit 24, and a control unit (module) 26, which is 
interconnected between the vocodCT 22 and the voice processing unit 24, and is 
connected to the system interface utility 20. The voice processing unit 24 
comprises a noise reduction utility 28 coupled to the vocoder 22 through the control 
unit 26, and a voce recognition utility 29 coupled to the noise reduction utihty 28. 

The operation of the system 10 will now be described with reference to Fig. 
2- Initially, the A/D converter 18 converts the input analog voice signal into an 
output digitar signal, and supplies the digital output to the vocoder 22 (step 30). 
The vocoder 22 is operable by suitable software to compress the digital signal. 

In the present example, a voice compression algorithm based on LPC 
analysis is utilized. It should, however, be noted that any other suitable technique 
can be used for digital signal compression, for example, the voice quantization 
technique. 

Thus, in the present example, to compress the input digital signal, it is 
divided into a series of frames (step 32). Each frame contains M samples x(m), . 
wherein m=l,2,3,. . .,M/'arid typically represents 20msec of the input signal. 

The signal x(m) is typically a sum of a speech signal component, s(m), and a 
stationary additive background noise component, n(m):, which is to be reduced, that 
is: 

x(m)^s(m)'^n(m) (1) 

The vocoder performs LPC analysis on each frame and provides an ou^ut 
compressed signal thereof (step 34). Generally, the LPC analysis can be applied to 
at least some samples of at least one frame. 
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As a result, the given signal sample xCm) is represented in the following 

form:. 

x(m) = J^a,x(m-0 + s(m)=J^a,[s(m-0 + n(m-i)'} + s(m) (2) 

wherein a,- are the LPC coefficients and B(m) is a residual signal, all being the 
parameters of the frame. Each frame has LPC coefficients tf/. 

The vocoder further parameterize the residual signal ^(m) in terms of at 
least pitch and gain values (step 36). 

The above coding scheme usually results in a compression factor of 
approximately 8-11. The output of the vocoder 22 is supplied to the noise 
reduction utility 26 through the control module 26. The noise reduction utility is 
operable to determine a power spectrum of the noiSe component during a 
non-speech activity (step 38), and to remove the power spectrum of the noise 
component from the noisy speech signal. In the present example, the power 
spectrum of a signal xOn) is denoted by I XO^,J ^ and is calculated as follows: 

^(0), ) =SCa?„ ) + N{a>„ ) = H(oj„ ) • Eia>, ) 

^(^J=— tH- 

(3) 

wherein S(^nt). ^(^m)- and E(^n^ are Fourier transforms of s(mjy n(m) and ^(m), 
respectively. It should be noted that, for non-speech frames, X(<^n^'='N((^nt)^ ' 

In the present invention, it is assumed that the power spectrum of ^(m) is 
constant, i.e., I E(<^m) ' By using Parseval theorem, the value of Eo^ can be 

estimated as follows: 
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The noise reduction utility detennines the noise power spectrum I A'ifo)^ ( ^ 
during the non-speech activity and calculates its average value < I N((o„^ I ^ over 
non-speech frames (step 40), as follows: 



(5) 



Using the above expressions, the noise reduction utility 28 determines the 



In eq^tion (6), all the samples which are less than zero are replaced 

by zeros (clipping condition). It should be noted that §(<^n^ is advantageously 
based only on p LPC coefl&cients ai (p«M) and on the total energy of the residual 
signal. 

As known, for example, firom the disclosure in the following book: A.V. 
Oppenhein et al., ''Digital Signal Processing**, Prentice Hall, Inc., Englewood 
Cleef,, NI, 1975, p. 557, the inverse Fourier transfomi of §(<^n^ is the 
autocorrelation function r(n) of the signal, that reads: 



speech signal power spectrum estimator with reduced noise component (step 
42), as follows: 



S(a;J=|H(^.)r'^o'-M^J 



(6) 




-^Yj^(m)^s(jh-n) 
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Based on the above equation, the noise reduction .utility 28 detennines 
modified LPC coefiBcients dk (step 44), To implement this, any known suitable 



technique can be used, for example, those disclosed in the book: Rabiner et al., 
"Fundamentals of Speech Recognition", Prentice Hall, 1993, pp 97-121. The 
modified LPC coefficients dk represent the compressed digital signal with the 
reduced noise component. 
5 Thus, the noise recognition utility detennines the modified LPC coefficients, 

generates an output compressed digital signal mdicative thereof, and supplies this 
signal to the voice recognition utility 29, which utilizes the same for perfonnmg the 
voice recognition. 

It should be noted that the noise reduction utility 28 can also produce 
10 various LPC based parameters, such as cepstrum coefficients, MEL cepstrum 

coefficients, line spectral pairs (LSPs), reflection coefficients, log area ratio (LAR) 

coefficients, and the like. 

Those skilled in tiie art will readily appreciate that various modifications and 

changes can be applied to the preferred embodiment of the invention as 
15 hereinbefore exemplified without departing from its scope defiped in and by the 

appended claims. For example, any suitable technique can be used to determine 

modified LPC coefficients. The voice operated system utiHzdng tiie voice 

processing unit according to the invention may be of any suitable type, other than 

the mobile phone device described above. 



